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DATE:  30  November  1989 

ANNUAL  REPORT  ON  CONTRACT  N00014-89-J-1174  R&T  CODE  441n011 

PRINCIPAL  INVESTIGATOR:  Elizabeth  D.  GetzofF 

CONTRACTOR:  Scripps  Clinic  and  Research  Foundation 

CONTRACT  TITLE:  Defining  Protein  Electrostatic  Recognition  Processes 

START  DATE:  1  December  1988 

RESEARCH  OBJECTIVE:  The  design  and  application  of  a  tightly  coupled  computational  and 
interactive  computer  graphics  approach  to  elucidate  the  nature  of  electrostatic  forces  controlling 
the  interaction  of  three  general  classes  of  electrostatic  recognition  processes:  stable,  catalytic  and 
transient  binding  complexes. 

PROGRESS  (Year  1): 

Precollision  Orientation  by  Electrostatic  Forces.  Attempts  to  define  macromolecular  complexes 
by  docking  static  crystallographic  structures  may  fail  because  local  structural  changes  are  not 
accommodated.  We  have  approached  the  problem  of  protein-protein  interaction  by  considering  the 
relative  orientations  of  the  two  molecules  at  a  separat.^n  distance  where  the  precollision  orientation 
is  dictated  by  long-range  electrostatic  forces. 

To  search  for  all  possible  favorable  energy  orientations  of  two  molecules,  two  problems  needed 
to  be  overcome.  First,  the  search  must  be  computationally  efficient,  and,  second,  the  molecules 
must  be  maintained  at  a  constant  minimum  distance.  The  first  problem  was  solved  by  calculating 
an  electrostatic  grid  about  one  molecule  and  using  the  grid  to  calculate  the  total  electrostatic 
interaction  with  the  other  molecule  in  its  many  orientations.  The  second  problem  was  solved 
by  making  and  aligning  expanded  molecular  dot  surfaces  for  each  molecule  and  checking  these 
surfaces  for  interpenetration.  The  program  TURNIP  used  these  methods  to  carry  out  a  systematic 
six  dimensional  search  of  paired  molecular  orientations  evaluated  by  their  resulting  electrostatic 
energy. 

In  the  first  year  of  this  contract,  we  have  developed  and  optimized  the  TURNIP  program,  which 
consists  of  two  parts.  First,  the  electrostatic  potential  due  to  molecule  1,  which  remains  fixed, 
was  calculated  onto  a  grid  that  was  large  enough  to  encompass  all  positioning  of  molecule  2  with 
its  expanded  surface.  Second,  molecule  2  was  then  moved  about  molecule  1  by  systematically 
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matching  one  expanded  surface  point  from  each  molecule  along  the  intermolecular  axis  (a  line 
between  the  centers  of  coordinates  of  the  two  molecules).  For  each  matched  pair  of  points,  molecule 
2  was  rotated  in  10°  increments  about  the  intermolecular  axis.  For  each  orientation,  in  which  the 
expanded  surfaces  interpenetrated,  molecule  2  was  incrementally  translated  a  small  distance  away 
from  molecule  1  along  the  intermolecular  axis  and  retested  for  penetration.  For  each  position  of 
molecule  2,  the  electrostatic  potential  energy  was  calculated  as  the  sum  of  the  charges  of  each 
atom  of  molecule  2  multiplied  by  the  potential  at  its  nearest  grid  point,  which  represented  the 
electrostatic  potential  of  molecule  1.  The  calculation  of  the  electrostatic  potential  from  molecule 
1  at  the  grid  points  is  proportional  to  the  number  of  atoms  of  1,  while  the  calculation  of  the 
intermolecular  energies  is  proportional  to  the  number  of  atoms  in  molecule  2.  Thus,  the  grid  saves 
substantial  calculation  time  compared  with  a  pairwise  calculation  for  each  orientation,  which  would 
be  proportional  to  the  number  of  atoms  in  molecule  1  times  the  number  of  atoms  in  molecule  2. 
This  calculation  constitutes  a  full  search  over  the  six  degrees  of  freedom  maintaining  a  constant 
distance  between  the  two  molecules. 

Because  the  program  is  divided  into  two  parts,  each  part  can  be  run  on  computers  with  speci¬ 
fications  that  match  the  task  at  hand.  For  a  complete  search  involving  two  ~100-residue  proteins, 
the  energy  grid  is  sufficiently  large  that  it  cannot  be  held  in  memory  on  our  Cray  XMP  at  Scripps 
Clinic.  Instead,  the  electrostatic  potential  grid  is  created  on  the  Cray  and  written  to  a  file  that  is 
then  transferred  to  our  Convex  C2  supercomputer  where  the  rotation  and  translation  of  molecule  2 
occurs.  The  program  TURNIP  is  written  in  the  programming  language  C,  and  has  been  optimized 
to  take  advantage  of  the  capabilities  of  each  machine.  In  the  last  year,  the  C  compiler  for  the 
Cray  has  been  greatly  improved.  Together  with  vectorization  and  optimization  of  the  code,  this  has 
improved  the  speed  by  more  than  a  factor  of  20.  Calculation  of  the  electrostatic  grid  is  currently 
done  with  a  Coulomb  potential  and  takes  about  an  hour  of  CPU  time  on  the  Cray  supercomputer  to 
generate  a  grid  with  spacing  of  0.5A  large  enough  to  maintain  a  sep  uticn  distance  of  6A  between 
two  ~100-residue  proteins  in  all  orientations.  More  complex  electrosiu.  methods  could  easily  be 
implemented  and  would  run  efficiently  on  this  machine.  The  virtual  memory  of  the  Convex  allows 
the  entire  grid  to  be  read  in  at  once.  A  complete  search  of  all  orientations  of  two  ~100-residue 
proteins  takes  about  28  hours  of  CPU  time.  A  coarser  grid  or  a  restricted  rotational  space  can 
substantially  reduce  the  time  necessary  for  either  part  of  the  program. 

We  have  applied  TURNIP  to  the  investigation  of  the  transient  interaction  between  two  electron 
transfer  proteins,  plastocyanin  and  cytochrome  c,  and  have  compare^  these  results  with  those 


found  by  visual  alignment  of  electrostatic  fields  displayed  with  computer  graphics.  The  computer 
graphics  method  gives  two  favorable  orientations  that  vary  by  a  180°  rotation.  The  comprehensive 
computational  search  methods  provide  a  family  of  orientations  centered  about  those  found  by 
computer  graphics  and  reveal  that  there  are  many  similar  energy  orientations  when  the  molecular 
surfaces  are  separated  by  6  and  12A.  All  orientations  have  the  exposed  heme  edge  of  cytochrome 
c  facing  the  acidic  patch  of  plastocyanin.  This  region  ot  favorable  approach  of  cytochrome  c  is 
consistent  with  experimental  data  showing  that  electron  transfer  proteins  may  meet  in  multiple 
orientations  and  then  translocate  along  the  protein  surfaces  to  reach  an  orientation  appropriate  for 
electron  transfer. 

Analysis  of  the  U10  Mutant  of  the  Phosphorylcholine- Binding  Antibody  SlO 7.  The  U10  antibody 
results  from  a  single-site  mutation  of  heavy  chain  Asp  101  in  the  parent  antibody,  S107,  to  an 
alanine.  The  mutated  residue  is  over  9  A  distant  from  the  antigen  binding  site,  but  results  in 
a  complete  loss  of  phosphorylcholine  binding  activity.  A  combination  of  molecular  dynamics  and 
minimization  techniques  suggested  that  the  mutant  antibody  may  fold  so  that  Arg  94  of  the  heavy 
chain,  which  forms  a  salt-bridge  with  Asp  101  in  a  related  phosphorylcholine-binding  antibody, 
McPC603,  extends  into  the  antigen  binding  pocket  to  electrostatically  and  sterically  block  the 
binding  of  the  hapten.  This  work  has  been  published  in  the  Proceedings  of  the  National  Academy 
of  Science,  USA  (copy  enclosed). 

Interactive  Computer  Graphics.  We  have  coupled  the  molecular  dynamics  programs  AMBER 
and  DISCOVER,  running  either  on  the  Cray  or  the  Convex  supercomputers,  through  a  network  con¬ 
nection  to  our  interactive  graphics  program  Flex  running  on  another  computer.  Now,  dynamics  or 
energy  minimizations  done  on  the  supercomputers  can  be  viewed  interactively  on  Sun  workstations 
while  the  calculations  are  being  performed. 

WORK  PLAN:  In  the  coming  year,  the  methodical  search  program  will  be  applied  to  the  com¬ 
ponents  of  two  protein  systems  of  known  structure,  where  electrostatics  appears  to  play  a  major 
role  in  molecular  recognition  and  orientation:  a  lysozyme/antibody  complex  and  the  tetramer  of 
plasmid  dihydrofolate  reductase.  We  are  also  continuing  development  of  methods  for  visualizing 
and  comparing  the  suggested  multiple  orientations  that  result  from  these  calculations. 
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ABSTRACT  To  study  the  molecular  basis  for  antibody 
diversity  and  the  structural  basis  for  antigen  binding,  we  have 
characterized  the  loss  of  phosphocholine  (P-Cho)  binding  both 
experimentally  and  computationally  in  U10,  a  somatic  mutant  of 
the  antibody  S107.  Nucleotide  sequencing  of  U10  shows  a  single 
base  change  in  JhI,  substituting  Asp-101  with  Ala,  over  9  A 
distant  from  the  P-Cho-binding  pocket.  Probing  with  anti- 
idiotypic  antibodies  suggests  local,  not  global,  conformational 
changes.  Computational  results  support  a  specific  structural 
mechanism  for  the  loss  of  P-Cho  binding.  The  U10  mutation 
eliminates  the  charged  interaction  between  Asp-101  and  Arg-94, 
which  allows  the  Arg-94  side  chain  to  disrupt  P-Cho  binding 
sterically  and  electrostatically  by  folding  into  the  P-Cho-binding 
site.  These  results  specifically  show  the  importance  of  the  Arg-94 
to  Asp-101  side  chain  salt  bridge  in  the  heavy-chain  CDR3 
conformation  and  suggest  that  residues  distant  from  the  binding 
site  play  an  important  role  in  antibody  diversity  and  inducible 
complementarity. 


Somatic  diversity  of  germ-line  immunoglobulin  variable  (V) 
region  genes  during  the  course  of  the  immune  response  can 
alter  the  affinity  and  specificity  of  antibodies  (1-5).  Correla¬ 
tion  of  individual  amino  acid  substitutions  with  changes  in 
antigen  binding  is  often  difficult  because  most  monoclonal 
antibodies  with  significant  changes  in  affinity  differ  by  mul¬ 
tiple  amino  acid  substitutions  in  both  heavy  (H)  and  light  (L) 
chains  (1-6),  To  examine  the  impact  of  somatic  mutation  on 
V  region  structure,  we  have  developed  an  in  vitro  assay  to 
isolate  spontaneously  generated  somatic  mutants  of  anti- 
phosphocholine  (P-Cho)  antibodies  with  altered  specificity 
and  affinity  (7).  The  S107.3.4  (S107)  myeloma  cell  line 
produces  IgA  encoded  by  the  same  germ-line  T15  gene 
segments  used  for  anti-P-Cho  antibodies  in  vivo:  VH1,  diver¬ 
sity  (D)  DFL  16.1,  and  joining  (J)  JH1  for  the  H  chain  (8)  and 
S107A  V.22  and  J.5  for  the  L  chain  (9).  Most  of  our  in  vitro 
mutants  from  S107  have  a  single  amino  acid  substitution  and 
thus  can  be  used  to  study  the  structural  and  molecular  basis 
of  somatic  diversification.  In  addition,  S107  has  high  se¬ 
quence  homology  to  the  P-Cho-binding  antibody  McPC603, 
whose  three-dimensional  structure  has  been  determined  (10). 
Here  we  report  a  mutant  antibody,  U10,  with  a  single  point 
mutation^  in  a  residue  outside  and  distant  from  the  antigen¬ 
binding  site  that  results  in  complete  loss  of  P-Cho  binding. 
The  similarity  of  the  McPC603  and  S107  antibodies  has 
allowed  us  to  use  computational  techniques  to  show  how  this 
mutation  could  cause  major  steric  and  electrostatic  changes 
in  the  antigen-binding  site. 


The  publication  costs  of  this  article  were  defrayed  in  part  by  page  charge 
payment.  This  article  must  therefore  be  hereby  marked  "advertisement" 
in  accordance  with  18  U.S.C.  11734  solely  to  indicate  this  fact. 


MATERIALS  AND  METHODS 

Isolation  and  Purification  of  U10,  S107,  and  U2U4  Proteins. 
The  U10  and  U2U4  mutants  were  isolated  from  the  S107  cell 
line,  and  the  proteins  were  purified  by  ion-exchange  chro¬ 
matography  on  a  DEAE-Sepharose  column  (7).  S107  protein 
was  purified  from  ascites  fluid  by  affinity  chromatography  on 
P-Cho-Sepharose  and  eluted  with  10  mM  P-Cho  (7). 

Fluorescence  Titrations.  Fluorescence  titrations  were  per¬ 
formed  as  described  (11)  in  the  laboratories  of  Fred  Karush 
(University  of  Pennsylvania)  and  Ken  Wu  (Albert  Einstein 
College  of  Medicine)  with  Perkin-Elmer  fluorescent  spectro¬ 
photometers.  Proteins  were  irradiated  at  295  nm,  and  emis¬ 
sions  were  read  at  333  nm. 

Direct  Binding  and  Competition  ELISAs.  Direct  binding 
ELlSAs  were  done  as  described  (12).  The  polymeric  IgA, 
anti-dextran  antibody.  W3129,  served  as  a  negative  control 
(13).  The  amount  of  bound  S107,  U10,  and  W3129  proteins 
was  determined  by  using  alkaline  phosphatase-linked  goat 
anti-mouse  IgA  antisera  (Zymed  Laboratories).  Competition 
ELISAs  were  performed  as  in  ref.  14.  The  amount  of  bound 
rabbit  anti-binding  site  antibody  was  detected  with  enzyme- 
linked  goat  anti-rabbit  IgG  antisera  (Zymed  Laboratories). 

mRNA  Sequencing.  Preparation  and  sequencing  of  U 10  and 
S107  mRNA  was  done  as  described  (15).  H-chain  constant 
(C)  region  1  (CH1)-  and  VH-specific  primers  were  used. 

Computer  Simulation  of  the  ARG-IN  and  ARG-OUT  Mod¬ 
els.  The  program  discover  (a  product  of  Biosym  Technolo¬ 
gies),  with  force-field  parameters  as  in  ref.  16.  was  used  to 
calculate  minimum  energy  structures  and  molecular  dynam¬ 
ics  trajectories.  A  model  of  S107  was  built  from  the  McPC603 
crystal  structure  (10)  by  substitution  of  32  amino  acids  in  the 
VL  chain  that  generally  were  far  from  the  P-Cho-binding  site, 
by  substitution  of  4  amino  acids  in  the  VH  chain,  and  by 
insertion  of  a  Tyr  residue  in  H-chain  CDR3  between  Arg-94 
and  Asp-101.  To  substitute  amino  acids,  original  atomic  side 
chain  positions  were  matched,  and  additional  atoms  were 
built  in  an  extended  conformation.  Hydrogen  atoms  were 
added  geometrically  to  all  crystallographic  or  built  atomic 
coordinates. 

Two  U10  models,  ARG-OUT  and  ARG-IN,  were  built 
starting  from  the  S107  model  with  the  Asp- 101  side  chain 
shortened  to  an  Ala.  Models  consisted  of  VL  Asp-1  to 
Leu-104  and  VH  Glu-1  to  Thr-1 10  with  N-methyl  groups  at  the 
carboxyl  termini.  In  the  ARG-OUT  model,  the  Arg-94  side 
chain  extended  out  to  the  surface  as  in  the  McPC603  structure 


Abbreviations;  P-Cho,  phosphocholine;  V,  variable;  H,  heavy;  L. 
light;  J,  joining;  C,  constant;  D,  diversity. 
tPresent  address:  Department  of  Pediatrics,  National  Jewish  Hos¬ 
pital,  Denver,  CO  80206. 

fThe  sequence  reported  in  this  paper  has  been  deposited  in  the 
GenBank  data  base  (accession  no.  M24452). 
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and  the  S107  model.  To  build  the  ARG-IN  model,  the 
13-residue  fragment  from  Tyr-90  to  Trp- 1001  (numbering  as  in 
ref.  26,  pp.  78  and  204)  was  extracted,  and  one  terminal 
nitrogen  atom,  NH2,  of  the  side  chain  of  Arg-94  was  template 
forced  (17)  to  the  position  of  the  P-Cho  nitrogen  atom  in  the 
McPC603  crystal  structure  while  the  ends  of  the  fragment 
were  fixed  in  position.  The  fragment  was  relaxed  by  1.3  psec 
of  dynamics  with  NH2  still  targeted  and  then  incorporated 
back  into  the  U10  structure  to  give  the  starting  point  for  the 
ARG-IN  model. 

Water  molecules  and  the  sulfate  ion  were  added  to  com¬ 
plete  the  U10  models.  Crystallographic  water  molecules 
within  15  A  of  the  active  site  (defined  as  the  sulfate  ion  and 
all  residues  within  4  A  of  /’-Cho;  i.e.,  VL  Phe-91,  Tyr-92, 
Tyr-94,  and  Leu-96  and  VH  Tyr-33,  Arg-52,  Asp-95,  and 
Trp- 1001)  were  added.  From  a  box  of  water  molecules 
generated  around  each  complex,  water  molecules  were  se¬ 
lected  to  fill  the  regions  within  7  A  of  the  active  site  and  the 
initial  Arg-94  position,  to  fill  the  regions  within  5  A  of  charged 
groups,  and  to  make  a  thin  shell  around  all  antibody  residues 
that  had  any  atom  within  15  A  of  the  active  site.  Hydrogen 
atoms  were  either  built  to  satisfy  a  clear  hydrogen-bonding 
pattern  (crystal  waters  only)  or  built  randomly  to  give  a 
H — O — H  angle  of  104.5°. 

To  gradually  relax  the  U 10  models,  the  nonhydrogen  atoms 
of  the  antibody  were  strongly  template  forced  (force  constant 
of  1000.0  kcal’A"1)  to  their  initial  positions  during  energy 
minimization  (steepest  descent  algorithm)  until  the  maximum 
energy  gradient  was  less  than  10  keal  mol "  '-A  " 1 .  To  complete 
the  minimizations,  all  waters,  the  sulfate  ion,  and  antibody 
residues  within  15  A  of  the  active  site  were  unconstrained 
while  residues  outside  this  area  were  strongly  template  forced 
to  their  initial  positions.  Minimization  to  convergence,  nec¬ 
essary  for  analysis  of  energetics  (18),  used  the  conjugate 
gradient  algorithm  with  the  full  force  field  and  no  nonbond 
cutoff.  The  final  root-mean-square  energy  gradients  were  less 
than  0.05  kcahnor^A'1,  and  the  root-mean-square  devia¬ 
tions  in  position  for  all  nonhydrogen  atoms  during  the  last 
1000  iterations  were  about  0.1  A. 

For  treatment  of  the  electrostatic  potential  in  the  calcula¬ 
tion  of  the  energetics  of  Arg-94,  Asp-95,  and  Glu-35,  a 
macroscopic  dielectric  constant  (e)  was  used.  If  all  of  the 
electrostatic  properties  (predominantly  electronic  polariza¬ 
tion  and  dipoles)  of  the  protein  were  described  microscopi¬ 
cally,  e  would  be  1.  A  dielectric  constant  of  4  has  been 
derived  from  studies  on  crystalline  polyamides  (19),  where 
both  electronic  polarization  and  dipoles  are  included  in  the 
macroscopic  constant.  Because  we  explicitly  include  dipoles 
as  partial  point  charges  on  each  atom,  the  major  contribution 
to  e  should  be  electronic  polarizability,  which  leads  to  a  value 
between  2  and  3  (20).  Therefore,  the  energetics  in  Table  1 
were  calculated  for  both  e  =  1  and  e  =  4  to  bracket  the 
electrostatic  potential  contribution. 

Structural  Analysis.  To  determine  the  range  of  conforma¬ 
tional  variation  reasonable  for  the  U10  mutant,  the  10  VL 
domains  and  7  VH  domains  superimposed  in  ref.  21  were 
compared  with  the  ARG-IN  and  ARG-OUT  models.  Com¬ 
puter  graphics  analysis  of  all  crystallographic  and  model 
structures  used  the  programs  gramps  (22)  and  granny  (23). 
Electrostatic  potential  surfaces  were  calculated  to  identify 
critical  electrostatic  interactions  in  the  P-Cho-binding  site  by 
using  the  programs  espot  and  essurf  (24,  25). 

RESULTS  AND  DISCUSSION 

Isolation  and  Characterization  of  the  U10  Mutant.  The  U10 
mutant  was  isolated  from  S107  based  on  its  lack  of  binding  to 
P-Cho  (7).  Like  S107,  U10  is  a  polymeric  IgA  with  two  to 
three  H2L2  units  per  molecule  and  standard  IgA  interchain 
disulfide  bonding.  The  H  and  L  chains  of  U10  were  indistin- 


P-Cho  x  10® 

Fig.  1.  Comparison  of  trypiophan  fluorescence  enhancement  of 
U10  and  S107  protein  with  P-Cho.  Values  for  fluorescence  enhance¬ 
ment  (E)  of  S107  (a)  compared  with  U10  (O)  are  calculated  and 
corrected  for  dilution  for  20  points  on  the  experimental  curve  derived 
in  ref.  11. 

guishable  in  size  from  those  of  S107  by  SDS/PAGE,  either 
with  or  without  tunicamycin  treatment.  However,  the  U10 
antibody  did  not  bind  /’-Cho  when  assayed  by  equilibrium 
dialysis  or  by  enhancement  of  tryptophan  fluorescence  (Fig. 
1).  An  ELISA,  which  is  very  sensitive  for  polymeric  anti¬ 
bodies  like  S107,  showed  no  binding  of  U10  to  P-Cho,  even 
at  high  antibody  concentrations  (F.g.  2). 

A  Single  Amino  Acid  Change  in  the  U10  VH  Region.  To 
determine  the  structural  basis  for  the  loss  of  antigen  binding 
by  the  U10  antibody,  we  sequenced  and  compared  the  S107 
and  U10  immunoglobulin  VL  and  VH  regions  and  the  contig¬ 
uous  CH1  and  C«  regions  directly  from  mRNA.  The  S107  and 
U10  VH  and  D  region  sequences  were  identical,  but  the  Jpl 
region  of  U10  had  a  single  adenosine  to  cytidine  transversion 
(Fig.  3),  which  resulted  in  the  substitution  of  an  Ala  residue 
for  Asp-101  (numbering  as  in  ref.  26.  pp.  78  and  204).  This 
mutation  was  confirmed  by  cDNA  sequencing  of  the  U10 
H-chain  V,  D,  and  J  regions.  Since  the  three-dimensional 
structure  of  the  related  McPC603  antibody  suggested  that 
residue  101  was  not  located  in  the  antigen-binding  site,  we 
looked  for  other  changes  in  the  nucleotide  sequence  of  the 
U10  antibody  gene.  The  CH1  region  of  the  U10  H  chain  (Fig. 
3)  was  found  to  be  identical  with  the  germ-line  S107  sequence 
(27).  The  S107  cell  line  produces  two  L  chains  (S107A  and 
S107B),  but  only  the  S107A  L  chain  is  assembled  and 
secreted  with  the  H  chain  in  the  S107  parent  and  the  U10 
mutant  (9).  The  VL,  J.5,  and  C„  regions  of  the  UlOand  S107A 
L  chains  were  identical,  consistent  with  chain-recombination 


Fig.  2.  Direct-binding  ELISA  of  UlOand  S107  protein  to  P-Cho. 
Increasing  concentrations  of  S107  (o),  U10  (•),  and  the  non- 
P-Cho-binding  control  W3129  (a)  antibodies  were  incubated  on 
P-Cho-bovine  serum  albumin-coated  ELISA  plates.  Binding  was 
measured  by  using  subclass-specific  goat  anti-mouse  immunoglob¬ 
ulin  linked  to  alkaline  phosphatase  and  is  expressed  as  OD  at  405  nm 
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S107  TGTCCCAATCTTCACATTCAGAXATCAGCACTCAGTCCTGTCACTATGAAGTTGTGGTTXAACTGGGTTTTTCTTTTaACACTTTTACATGGTATCCAGTGTGAGGTGAAGCTGGTGGAA  120 


010 


HV1 


S107  TCTGGAGGAGGCT'GGTACAGCCTGGGGGTTCTCTGACACTCTCCTGTGCAACrTCTGGGTTCACCTTCAGTGATTTCTACATGGAGTGGoTCCGCCAGCCTCCAGGGAAGAGACTGGAG  2  40 

010  - 


S107  TGGATTGCTGCAAGTAGAAACAAAGCTAATGATTATACAACAGAGTACAGTGCATCTGTGAAGGGTOGGTTCATCGTCTCCAGAGACACTTCCCAAAGCATCCnCTACCTTCAGATGAAT  3  60 

010  - 

j— ►D  | - -  JK  Taql  | - -CH1 

S107  GCCCTGAGAGCTGAGGACACTGCCATTTATTACTGTGCAAGAGATTACTACGGTAGTAGCTACTGGTACTTCGATGTCTGGGGCGCAGGGACCACGGTCACCGTCTCCTCAGAGTCTGCG  4  60 

Asp 

010  - (GCTI - 

Ala 


SI 07  AGAAATCCCACCATCTACCCACTGACACTCCCACCAGTCCTGTGCAGTGATCCCGTGATAATCGGCTGCCTGATTCACGATTACTTCCCTTTCGGCACGATGAATGTGACCTGGGGAAAG  600 

010 - - — - 


SI 07  AGTGGGAAGGATATAACCACCGTGAACTTTCCACCTGCCCTCGCCTCTGGGGGACGGTACACCATGAGCAGCCAGTTAACCCTGCCAGCTGTCGAGTGCCCAGAAGGAGAGTCCGTGAAA  720 
010 - 

1 

S107  TGTTCCGTGCAACATGACTCTAACCCCGTCCAAGAATTGGATGTGAATTGCTCTG  70S 

01 0  - 


Fig.  3.  Sequence  comparison  of  the  rearranged  H-chain  genes  of  U10  and  S107.  The  nucleic  acid  sequence  includes  a  portion  of  the  5' 
untranslated  region  and  the  leader.  The  VH.  D,  Jh.  and  ChI  domains  of  the  U10  and  S107  genes  are  shown  labeled  at  the  first  base  of  each  region. 
The  solid  line  representing  identity  of  the  two  sequences  is  interrupted  by  the  altered  codon  (in  parentheses)  and  the  resulting  amino  acid 
substitution  in  U10.  The  first  (HV1)  and  second  (HV2)  hypervariable  regions  and  the  Tag  1  site  that  is  lost  due  to  the  base  change  are  also  shown. 


experiments,  suggesting  that  the  loss  of  antigen  binding  is 
associated  with  the  H  chain  (28).  Thus,  the  only  sequence 
difference  between  the  S107  and  U10  V  and  adjoining  C 
region  domains  is  the  substitution  of  an  uncharged  Ala  for 
negatively  charged  Asp-101  in  the  fifth  residue  of  JH1. 

Significant  Conformational  Change  in  the  U10  Binding  Site. 
To  examine  the  impact  of  the  U10  mutation  on  V  region 
structure,  we  examined  the  reactivity  of  a  number  of  anti- 
idiotypic  antibodies  with  the  S107  and  U10  antibodies.  Al¬ 
though  eight  different  anti-T15  monoclonal  antibodies,  spe¬ 
cific  for  nonbinding  site  surfaces  of  the  V  domain  (29),  did  not 
distinguish  between  the  U10  and  S107  proteins,  polyclonal 
antibodies  to  the  antigen-binding  site  of  S107  did  (Fig.  4). 
Binding  of  a  polyclonal  antiserum  to  S107  could  be  inhibited 
by  the  addition  of  cither  S107  or  U2U4,  a  single-site  mutant 
of  S107  exhibiting  a  significant  decrease  but  not  a  complete 
loss  of  P-Cho  binding  (30).  In  contrast,  much  higher  concen¬ 
trations  of  the  U10  protein  only  partially  inhibited  S107 
binding  (Fig.  4).  The  same  results  were  obtained  with  three 
different  monoclonal  anti-binding  site  antibodies,  suggesting 
a  conformational  change  in  the  U10  binding  site. 

Computer  Modeling  of  U10.  The  three-dimensional  struc¬ 
ture  of  the  V  region  of  the  U10  antibody  was  analyzed  starting 
from  the  two  crystallographic  structures  of  the  S107-like 
antibody  McPC603  (10,  31).  One  had  P-Cho  bound  and  the 


Fig.  4.  Determination  of  a  conformational  change  in  the  binding 
site  of  U10  by  competition  ELISA.  The  ability  of  U10(o),  S107  (o), 
and  U2U4  U)  proteins  to  inhibit  the  binding  of  a  binding-site-specific 
polyclonal  antiserum  to  S107  protein  was  tested.  Binding  is  ex¬ 
pressed  in  OD  units.  At  the  concentrations  used,  the  polyclonal 
antiserum  is  inhibited  100%  by  P-Cho  (■)  as  indicated. 


other  a  sulfate  ion.  A  model  of  the  S107  V  domain  was  built 
from  the  sulfate-bound  McPC603  structure  (Fig.  5A)  by 
incorporating  appropriate  amino  acid  substitutions.  Calcula¬ 
tions  of  the  electrostatic  potential  on  the  molecular  surface  of 
the  hapten  and  the  V  regions  of  the  McPC603  stmeture  and 
the  S107  model  reveal  strong  electrostatic  complementarity 
between  the  bound  P-Cho  hapten  and  the  antibodies.  These 
results  quantitate  local  electrostatic  stabilization  of  the  hap¬ 
ten  suggested  in  analyses  of  the  crystal  structure  (10).  Res¬ 
idues  contributing  most  to  the  local  electrostatic  complemen¬ 
tarity  with  P-Cho  were  all  in  VH:  Arg-52  with  the  phosphate 
moiety  and  Glu-35  and  Asp-95  with  the  positively  charged 
trimethylammonium  g^oup  (see  Fig.  5A). 

In  the  McPC603  structures,  a  hydrogen-bonded  salt  bridge 
in  VH  between  the  side  chains  of  Asp-101  and  Arg-94  forms 
at  the  surface  of  the  antibody,  with  both  side  chains  extending 
away  from  the  active  site,  as  seen  in  the  model  of  S107  in  Fig. 
5A.  Asp-101  is  11.8  A  distant  from  P-Cho,  and  Arg-94  is  9.3 
A  distant.  Disruption  of  this  salt  bridge  due  to  the  Ala-101 
mutation  of  U 10  could  affect  the  conformation  of  Arg-94  such 
that  the  Arg-94  side  chain  could  extend  into  the  negatively 
charged  pocket  that  binds  the  positively  charged  trimethyl¬ 
ammonium  moiety  of  P-Cho  in  the  nonmutated  antibody. 
Rotation  of  the  whole  length  of  the  Arg  side  chain  by  180° 
would  allow  the  terminal  nitrogen  atoms  to  extend  to  within 
2  A  of  the  P-Cho  nitrogen  in  the  bound  complex. 

To  explore  this  hypothesis,  two  models  of  the  V  region  of 
the  U10  protein,  termed  ARG-IN  and  ARG-OUT.  were  built 
from  the  S107  model  by  substitution  of  Asp-101  with  Ala  by 
side-chain  truncation.  Computer  graphics  followed  by  a 
combination  of  template  forcing,  molecular  dynamics,  and 
full  energy  minimization,  including  explicit  solvent  mole¬ 
cules,  yielded  stereochemically  reasonable  structures.  In  the 
ARG-OUT  model,  the  side-chain  positions  were  maintained 
as  in  the  McPC603  structure,  while  in  the  ARG-IN  model,  the 
Arg-94  side  chain  was  moved  into  the  P-Cho-binding  site 
(Fig.  5 B).  Since  Arg-94  and  Asp-95  are  adjacent  on  a  /3- 
strand,  bringing  Arg-94  into  the  binding  pocket  caused  the 
Asp-95  side  chain  to  extend  out  to  the  surface  away  from  the 
binding  pocket. 

Root-mean-square  deviations  of  both  ARG-OUT  and 
ARG-IN  from  McPC603  framework  main-chain  positions 
were  well  within  the  range  observed  in  immunoglobulin 
crystal  structures  (21),  suggesting  that  main-chain  conforma¬ 
tional  changes  made  in  the  ARG-IN  model  to  accommodate 
the  Arg-94  side-chain  repositioning  are  plausible.  Superpo¬ 
sition  of  individual  immunoglobulin  VL  and  VH  domains  onto 
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Fig.  5.  Computer  graphics  of  the  derived  S107  and  U10  models.  (A)  A  stereo  pair  showing  the  location  and  stereochemistry  of  critical 
charged-residue  side  chains  in  the  computationally  derived  model  of  the  F-Cho-binding  antibody  S107.  The  o-carbon  backbones  for  the  H  (blue, 
bottom)  and  L  (light  purple,  top)  chains  are  shown  together  with  residue  labels  and  bonds  for  positive  (light  blue)  and  negative  (red)  side  chains 
and  F-Cho  (center)  moieties.  The  trimethylammonium  portion  of  F-Cho  is  shown  in  light  blue  (rear  center),  and  the  phosphate  portion  is  shown 
in  red  (front  center).  The  salt  bridge  between  Asp-101  (red  label,  bonds,  and  surface)  and  Arg-94  (light-blue  label,  bonds,  and  surface)  occurs 
on  the  outside  surface  of  the  antibody  (right)  over  9  A  from  the  active  F-Cho-binding  site  formed  at  the  interface  of  the  H  and  L  chains.  (B) 
The  minimized  computationally  derived  models  for  the  ARG-IN  and  ARG-OUT  mutant  structures.  The  o-carbon  backbones  for  the  ARG-IN 
(H  chain,  light  blue,  bottom,  L  chain,  pink,  top)  and  the  ARG-OUT  models  (H  chain,  light  green,  bottom;  L  chain,  purple,  top)  superimpose 
closely  except  for  VH  CDR3.  The  sulfate  (yellow  label,  bonds,  and  surface),  binding-pocket  side  chains  of  Arg-52  (blue  label  and  bonds;  lower 
left)  and  Glu-35  (red  label,  bonds,  and  surface;  lower  center),  and  the  single  site  mutation  at  Ala-101  (light-blue  label  and  bond;  right)  are  shown 
in  the  energetically  favored  ARG-IN  model  only.  The  side  chains  of  Arg-94  and  Asp-95  are  shown  in  both  the  ARG-IN  and  ARG-OUT  models. 
In  the  ARG-OUT  model  (yellow  bonds  and  arrows  from  label),  Arg-94  extends  out  from  the  F-Cho-binding  site  but  has  lost  its  charged  interaction 
with  residue  101  (Ala  in  U10),  while  Asp-95  points  into  the  F-Cho-binding  site  containing  negatively  charged  Glu-35.  In  the  ARG-IN  model, 
Arg-94  (light-blue  bonds,  label,  and  arrow)  extends  into  the  binding  site,  and  Asp-95  (red  bonds,  label,  and  arrow)  extends  to  the  outside  surface 
and  points  away  from  the  F-Cho-binding  site. 


the  corresponding  domains  of  McPC603  gave  root- 
mean-square  deviations  varying  from  0.48  A  to  1.69  A  forVL 
and  from  0.80  A  to  1.29  A  for  VH,  with  values  for  ARG-IN 
of  0.48  A  for  VL  and  0.72  A  for  VH  and  for  ARG-OUT  of  0.48 
A  for  VL  and  0.56  A  for  VH.  Moreover,  computer  graphic 
examination  of  10  superimposed  VL  domains  and  7  Vti 
domains  indicated  that  H-chain  sites  in  HyHEL5  (residue  8). 
HEdlO  (residue  44),  and  Newm  (residue  73)  as  well  as  the 
L-chain  site  in  Kol  (residue  12)  showed  greater  framework 
variability  than  in  our  ARG-IN  model  of  U10  at  H-chain 
residues  94-95. 

Role  of  Arg-94  in  Binding  Specificity.  To  understand  the 
differences  between  the  ARG-IN  and  ARG-OUT  models  in 
more  detail,  energetics  for  three  critical  combining  site  res¬ 
idues  (Arg-94,  Asp-95,  and  Glu-35  of  VH)  were  calculated 
(Table  1).  To  bracket  the  electrostatic  contribution,  the 
energetics  were  calculated  with  a  dielectric  constant  (e)  of 
both  1  (probably  overestimates  the  electrostatic  contribution) 
and  4  (underestimates).  The  energetics  of  Arg-94  (Table  1), 
more  favorable  by  1.4  keal-mol"1  ( e  =  4)  to  4.0  kcalmol'1  (e 
=  1)  for  the  ARG-IN  model  compared  with  ARG-OUT, 
reflect  that  the  water-solvated  Arg-94  side  chain  of  the 
McPC603-like  model  ARG-OUT  extends  away  from  the 
F-Cho-binding  site  with  no  formation  of  new  intramolecular 
hydrogen-bonding  interactions,  whereas  the  Arg-94  side 
chain  of  ARG-IN  extends  into  the  active  site  and  forms  two 
hydrogen-bonded  salt  bridges,  one  with  the  sulfate  ion  and 
one  with  VH  Glu-35  (Fig.  5 B).  The  energetics  of  Asp-95  are 
much  more  favorable  in  the  ARG-IN  model  by  32.6 
kcal-mol'1  for  t  -  1,  while  the  energetics  are  slightly  less 
favorable  by  0.8  kcal-mol'1  for  t  =  4.  Thus,  the  ARG-IN 
model  indicates  that  there  is  increased  stability  of  the  U10 
mutant  when  the  Arg-94  side  chain  is  positioned  in  the 


negative  potential  of  the  binding-site  pocket  while  the  Asp-95 
side  chain  extends  out  and  away  from  the  pocket  into  solvent . 

Table  1.  Energetics  of  H-chain  Arg-94,  Asp-95,  and  Glu-35  for 
the  two  U10  mutant  models  ARG-IN  and  ARG-OUT 


Potential  energy,  kcal  mol  1 


t  =  I 

e  =  4 

Arg-94 

Asp-95 

Glu-35 

Arg-94 

Asp-95 

Glu-35 

VL 

48.3 

-66.5 

ARG-IN 

-67.9 

7.7 

-17.6 

-18  1 

Vh 

-109.7 

-78.2 

—  142.4 

-23.7 

- 18.6 

-34.9 

Sulfate 

- 115.4 

43.8 

71.5 

-29.3 

11.0 

17.9 

Waters 

46.4 

-93.2 

-38.1 

11.0 

- 19.4 

-9.8 

Total 

-130.4 

-194.1 

-176.9 

-34.3 

-44  6 

-44.9 

VL 

57.9 

-61.5 

ARG-OUT 

-74.4 

13.9 

-17.9 

-20.6 

Vh 

-119.8 

-96.0 

-93.7 

-31.5 

-27.6 

-27.6 

Sulfate 

-38.1 

66.0 

64.0 

-9.5 

16.5 

16.0 

Waters 

-26.4 

-70.0 

-52.5 

-5.8 

-16.4 

-13.5 

Total 

-126.4 

-161.5 

-156.6 

-32.9 

-45.4 

-45.7 

To  bracket  the  electrostatic  contribution,  potential  energies  were 
calculated  for  both  a  dielectric  of  1  (r  =  1)  and  a  dielectric  of  4  (r  = 
4).  The  potential  energy  of  each  of  these  residues  with  the  VL  chain, 
the  sulfate  ion,  and  the  waters  is  due  to  nonbond  interactions,  the 
sum  of  electrostatic  and  van  der  Waals  energies.  For  example,  the 
interaction  between  Arg-94  and  the  VL  chain  is  the  sum  of  the 
interactions  between  each  atom  of  Arg-94  and  each  atom  of  the  VL 
chain.  The  potential  energy  of  each  residue  with  the  VH  chain  is  the 
sum  of  the  nonbond  energies  with  all  residues  in  the  VH  chain  plus 
the  internal  energy  (strain  in  the  bond,  angle,  and  torsion  angle 
energies)  calculated  for  that  specific  residue. 
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In  the  ARG-IN  model,  the  VH  Glu-35  side  chain  changes 
conformation  to  allow  salt-bridge  formation  with  the  Arg-94 
side  chain  (shown  as  intersecting  surfaces  in  Fig.  5B).  Glu-35 
forms  this  salt  bridge  while  maint  -ig  its  hydrogen  bond  to 
the  side  chain  of  L-chain  Tyr-9  u-35  does  not  form  any 
salt  bridges  in  the  crystal  stnK  „■  of  McPC603  or  in  the 
minimized  ARG-OUT  structure.  For  Glu-35  (Table  1),  the 
salt-bridged  ARG-IN  model  is  energetically  more  favorable 
(by  20.3  kcal-mol  *)  for  e  =  1,  whereas  for  e  =  4,  it  is  slightly 
less  favorable  (by  0.8  kcal-mol"1).  Thus,  having  the  Arg-94 
side  chain  in  the  active  site  puts  Glu-35,  a  residue  known  to 
be  important  in  P-Cho  binding  (30.  32),  in  a  more  energeti¬ 
cally  favorable  position. 

Among  related  antibodies  (ref.  26,  pp.  204-208)  with 
sequence  changes  for  Asp-101,  several  in  subgroup  111  A  have 
replacements  of  Asp-101  (to  Ala  or  Gly),  while  retaining 
Arg-94.  Only  one  of  these  (CBA/J  6G6)  is  known  to  bind 
P-Cho.  If,  as  we  have  explored  above,  the  Asp-101  residue  is 
necessary  to  form  a  salt  bridge  with  Arg-94  so  that  P-Cho 
binding  is  retained,  how  can  we  explain  this  exception? 
CBA/J  6G6  has  a  much  shortened  CDR3  between  Arg-94  and 
Ala-101  and  has  Asp-98  two  residues  before  Ala-101,  sug¬ 
gesting  that  Asp-98  can  replace  Asp- 101  in  the  salt  bridge  to 
Arg-94. 

Role  for  Side-Chain  Rearrangements  in  Antibody  Diversity. 
We  have  found  that  a  single  base  pair  mutation  in  the  JhI  gene 
of  the  S107  VH  region,  corresponding  to  an  amino  acid 
outside  and  distant  from  the  active  site,  completely  elimi¬ 
nates  antigen  binding  in  the  somatic  mutant  U10.  Similar 
observations  have  been  reported  in  anti-lysozyme  antibodies 
(33).  Our  results  and  observations  by  others  of  substitutions 
in  JH1  (1,  34)  and  in  JH3  and  JH4  (35)  suggest  that  residues 
distant  from  the  binding  site  may  play  an  important  role  in  the 
specificity  and  affinity  of  the  antigen-binding  site.  The  loss  of 
P-Cho  binding  and  serological  studies  with  binding-site- 
specific  antibodies  indicate  a  significant  conformational 
change  local  to  the  antigen-binding  site  in  U 10  greater  than 
that  created  by  the  Glu-35  to  Ala  substitution  in  the  binding 
site  of  the  single-site  S107  mutant  U2U4. 

Our  two  models  of  the  U10  antibody,  ARG-IN  and  ARG- 
OUT,  differ  in  placement  of  the  Arg-94  side  chain.  In  the 
ARG-IN  model,  the  Arg  guanidinium  group  lies  in  the  posi¬ 
tion  that  would  be  occupied  by  the  trimethylammonium 
group  of  P-Cho  and  results  in  more  favorable  energies  for 
Arg-94,  Asp-95,  and  Glu-35  compared  with  the  ARG-OUT 
model  (Table  1).  Thus,  a  role  of  the  Asp-101  side  chain  in 
McPC603  and  S107  may  be  to  keep  the  Arg-94  side  chain  out 
of  the  negatively  charged  binding  site  by  forming  a  salt  bnuge 
to  it  on  the  exterior  of  the  antibody.  Lack  of  the  Arg-94  to 
Asp-101  salt  bridge  in  the  U10  mutant  may  allow  the  Arg-94 
side  chain  to  be  folded  into  the  active  site,  as  modeled  by 
ARG-IN ,  explaining  the  complete  loss  of  P-Cho  binding  with 
little  overall  conformational  change. 
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